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What is claimed is : 

1. A document searching apparatus for 
searching a document group having a link relation 

5 for a document, comprising: 

a link importance assigning unit weighting the 
link relation and assigning link importance which 
indicates importance of the document based on the 
weighted link relation to the document; and 
10 an accessing unit accessing the document based 

on the link importance. 

2. The document searching apparatus as set 
forth in claim 1, 

15 wherein said link importance assigning unit 

includes : 

a URL similarity calculating unit calculating 
a URL similarity that is a similarity of URLs 
(Uniform Resource Locators) that represent the 
20 document, 

wherein said link importance assigning unit 
calculates the link importance based on the URL 
similarity and the link relation of the document. 



25 



3. The document searching apparatus as set 
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forth in claim 1, further comprising: 

a keyword extracting unit extracting text 
parts from the documents and extracting a keyword 
from the document contents. 

5 

4. The document searching apparatus as set 
forth in claim 3, 

wherein said keyword extracting unit 
calculates an occurrence frequency of the keyword 
10 in the document, and 

wherein said keyword extracting unit further 
comprises : 

a keyword - document correlation calculating 
unit calculating the correlation of the keyword and 
15 the document based on the link importance and the 
occurrence frequency of the keyword. 

5. The document searching apparatus as set 
forth in claim 4, further comprising: 

20 a monitoring unit monitoring accesses from a 

user and generating an access log, and 

wherein said keyword - document correlation 
calculating unit calculates the correlation based 
on the keyword occurrence frequency, the link 

25 importance, and the access log. 
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6. The document searching apparatus as set 
forth in claim 4, further comprising: 

a document type determining unit determining a 
document type of the document based on the URL 
similarity, the number of links from the document, 
and the number of links to the document, 

wherein said keyword - document correlation 
calculating unit selects the document based on the 
document type and calculates the correlation for 
the selected document. 

7. The document searching apparatus as set 
forth in one of claims 4, further comprising: 

an index creating unit creating an index for 

accessing the document corresponding to 

pronunciation characters or spelling of the 
extracted keyword. 

8. The document searching apparatus as set 
forth in claim 7, further comprising: 

a selecting unit allowing the user to select a 
portion of the pronunciation characters or spelling 
of the keyword, 

wherein said index creating unit places less 
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than a predetermined number of documents highly 
correlated with the keyword in the index based on 
the correlation calculated by said keyword 
document correlation calculating unit, and 
5 wherein said accessing unit accesses the 

document based on the selected keyword. 

9. The document searching apparatus as set 
forth in one of claims 1, further comprising: 

10 a collecting unit collecting the document from 

a network. 

10. The document searching apparatus as set 
forth in claim 1, 

15 wherein said link importance assigning unit 

causes the weight of the link relation between the 
documents with a high URL similarity to be 
decreased. 

20 11. The document searching apparatus as set 

forth in claim 1, 

wherein said link importance assigning unit 
causes the document that is linked from important 
document and whose URL similarity is low to be 

2 5 important. 
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12. The document searching apparatus as set 
forth in claim 1, 

wherein said link importance assigning unit 
causes the importance of document linked from many 
document whose URL similarity are high to be 
decreased. 

13. The document searching apparatus as set 
forth in claim 1, 

wherein the link importance of each documant 
is defined as a solution of the following 
simultaneous linear equation (1) , assuming that C q 
is constant (the lower limit of the importance that 
depends on each page) for each p e DOC and that when 
a page p is linked to a page q, the link weight lw 
(p, q) is defined by the formula (2) : 

Wq = Cq+ ••• (1) 

peRefed(q) 

lw(p,g) = diff(p,g)I Y J dW(p,i) = 1 J 

ieR ef( P ) sim(p,q) X . , 

,J^p) sim(p,i) 

... (2) 

where DOC = {pi, p2, pN} is a set of 

documents calculated for the link importance; Wp is 
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the link importance of the page p; Ref(p) is a set 
of pages linked from the page p; Refed(p) is a set 
of pages linking to the page p; sim(p, q) is the 
URL similarity of the pages p and q; dif f (p, q) = 
5 l/sim(p, q) is the difference. 

14. The document searching apparatus as set 
forth in one of claims 1, 

wherein the URL similarity is determined based 
10 on characters of a URL containing a server address. 

15. A document index creating apparatus for 
creating an index of a document group having a link 
relation, comprising: 

15 a link importance assigning unit assigning a 

link importance to the document based on the link 
relation; 

a keyword extracting unit extracting a keyword 
from the document; 
20 an index creating unit creating an index for 

accessing the keyword based on pronunciation 
characters or spelling of the extracted keyword; 
and 

an accessing unit accessing document assigned 
25 the link importance corresponding to the keyword 
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when the pronunciation characters or spelling of 
the keyword are selected from the index. 

16. The document index creating apparatus as 
set forth in claim 15, 

wherein said link importance assigning unit 
includes : 

a URL similarity calculating unit calculating 
a URL similarity that is a similarity of URLs 
(Uniform Resource Locators) that represent the 
location of the documents in a netowork, 

wherein said link importance assigning unit 
calculates the link importance based on the URL 
similarity and the link relation of the document. 

17. A document index creating apparatus for 
creating an index of a document group having a link 
relation, comprising: 

a link importance assigning unit assigning a 
link importance to the document depending on 
whether or not URLs of the documents are similar; 

a keyword extracting unit extracting a keyword 
from the document; and 

an index creating unit creating an index for 
accessing the document corresponding to 
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pronunciation characters or spelling of the 
extracted keyword based on the link importance. 

18. A link list creating system for creating a 
link list for a document group having a link 
relation, comprising: 

a collecting unit collecting the documents 
from a network; 

a link importance assigning unit assigning a 
link importance as of the document an importance 
calculated based on the link relation to the 
document ; 

a URL character string determining unit 
determining a URL having a particular 
characteristic of a character string from the 
documents; and 

an index creating unit creating a link list 
for listing less than a predetermined number of 
links to the documents based on the link importance 
and the particular characteristic of the character 
string of the URL. 

19. The link list creating system as set 
forth in claim 18, further comprising: 

a document type determining unit determining a 



document type of the document based on a URL 
similarity reperesenting the similarity of between 
URLs of the documents, the number of links to the 
document, and the number of links from the document, 
wherein said index creating unit selects the 
document based on the document type and creates the 
link list of the selected document. 

20. A document searching method for searching 
a document group having a link relation for a 
document, comprising: 

assigning a link importance as an importance 
of the document calculated with weighting the link 
relation to the document; and 

accessing the document based on the link 
importance . 

21. The document searching method as set forth 
in claim 20, further comprising: 

calculating a URL similarity that is a 
similarity of URLs (Uniform Resource Locators) that 
represent the document; and 

calculating the link importance based on the 
URL similarity and the link relation of the 
document . 



22. The document searching method as set forth 
in claim 20, further comprising: 

extracting a keyword from the document. 

23. The document searching method as set forth 
in claim 20, further comprising: 

calculating an occurrence frequency of the 
keyword in the document, and 

calculating the correlation of the keyword and 
the document based on the link importance and the 
occurrence frequency of the keyword. 

24. The document searching method as set forth 
in claim 23, further comprising: 

monitoring accesses from a user and generating 
an access log; and 

calculates the correlation based on the 
keyword occurrence frequency, the link importance, 
and the access log. 

25. The document searching method as set forth 
in claim 23, further comprising: 

determining a document type of the document 
based on the URL similarity, the number of links to 
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the document, and the number of links from the 
document; and 

selecting the document based on the document 
type and calculating the correlation of the 
5 selected document. 

26. The document searching method as set forth 
in one of claims 22, further comprising the step 
of: 

10 creating an index for accessing the document 

corresponding to pronunciation characters or 
spelling of the extracted keyword. 

27. The document searching method as set forth 
15 in claim 26, further comprising the steps of: 

placing less than a predetermined number of 
documents which are correlated with the keyword in 
the index; 

selecting a portion of the pronunciation 
20 characters or spelling of the keyword; and 

accessing the document corresponding to the 
selected portion of the pronunciation characters or 
spelling of the selected keyword. 
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28. The document searching method as set forth 
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in one of claims 20, further comprising the step 
of: 

collecting the document from a network. 

29. A link list creating method for creating a 
link list for a document group having a link 
relation, comprising the steps of: 

colleting the document from a network; 

assigning a link importance which indicates 
inmortance of the document to the document based on 
the link relation; 

determining a URL having a particular 
characteristic of a character string from the URLs 
of each document; and 

creating a link list for listing less than a 
predetermined number of links to the document based 
on the link importance and the particular 
characteristic of the character string of the URL. 

30. The link list creating method as set forth 
in claim 29, further comprising the steps of: 

determining a document type of the document 
based on the URL similarity, the number of links to 
the document, and the number of links from the 
document; and 
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selecting the document based on the document 
type and creating the link list for the selected 
document based on the link importance and the 
particular characteristic of the character string 

of the URL. 

31. A computer readable record medium for 
recording a program that causes a computer to 
execute a process for creating a link list for a 
document group having a link relation, the program 
comprising the steps of: 

colleting documents from a network; 

assigning a link importance which indicates 
inmortance of the document to each document based 
on the link relation; 

determining a URL having a particular 
characteristic of a character string from the URLs 
of documents; and 

creating a link list for listing less than a 
predetermined number of links to the documents 
based on the link importance and the particular 
characteristic of the character string of the URL. 



